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WO 99/58702 PCT/US99/10519 

CELL-FREE CHIM ERAPL ASTY AND EUKARYOTIC USE 
OF HET ERODUPLEX MUTATIONAL VECTORS 

1 . FIELD OF THE INVENTION 

Chimeraplasty concerns the introduction of directed alterations in a specific site of 
the DNA of a target cell by introducing duplex oligonucleotides, which are processed by the 
cell's homologous recombination and error repair systems so that the sequence of the target 
DNA is converted to that of the oligonucleotide where they are different. The present 
invention concerns a chimeraplasty method that is practiced in a cell-free system. 
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2 BACKGROUND TO THE INVENTION 
2.1 Chimeraplasty 

Chimeraplasty in eukaryotic cells and duplex recombinagenic oligonucleotides for 
use therein arc disclosed in U.S. Patent No. 5,565,350, issued October 15. 1996, and No. 
5,731.181. issued March 24, 1998 by E.B. Kmiec (collectively "Kmiec"). The 
recombinagenic oligonucleotides disclosed by Kmiec contained ribo-type, e.g., 2'-0- 
methyl-ribonucleotides, and deoxyribo-type nucleotides that were hybridized to each other 
and were termed Chimeric Mutational Vectors (CMV). A CMV designed to repair a 
mutation in the gene encoding liver/bone/kidney type alkaline phosphatase was reported in 
Yoon, K., et al., 1996, Proc. Natl. Acad. Sci. 93, 2071. The alkaline phosphatase gene was 
transiently introduced into CHO cells by a plasmid. Six hours later the CMV was 
introduced. The plasmid was recovered at 24 hours after introduction of the CMV and 
analyzed. The results showed that approximately 30% to 38% of the alkaline phosphatase 
genes were repaired by the CMV. 

A CMV designed to correct the mutation in the human P-globin gene that causes 
Sickle Cell Disease and its successful use was described in Cole-Strauss, A., et al.. 1996, 
Science 273, 1386. A CMV designed to create a mutation in a rat blood coagulation factor 
IX gene in the hepatocyte of a rat is disclosed in Kren et al., 1998, Nature Medicine 4, 285- 
290. An example of a CMV having one base of a first strand that is paired with a non- 
complementary base of a second strand is shown in Kren et al., June 1997, Hepatology 25, 
30 1462. 

United States Patent Application Serial No. 08/640,517, filed May 1, 1996, by E.B. 
Kmiec, A. Cole-Strauss and K. Yoon, published as W097/41 141, November 6, 1997, and 
application Serial No. 08/906,265, filed August 5, 1997, disclose methods and CMV that 
are useful in the treatment of genetic diseases of hematopoietic cells, e.g., Sickle Cell 
Disease, Thalassemia and Gaucher Disease. 

An example of the use of a CMV having one base of a first strand that is paired with 
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a non-complementary base of a second strand is shown in Kren et al., June 1997, 
Hepatology 25, 1462. In Kren, the strand having the different desired, sequence was the 
strand having 2'-0-methyl ribonucleotides, which was paired with the strand having the 3' 
end and 5' end. U.S. Patent No. 5,565,350 described a CMV having a single segment of 2'- 
O-methylated RNA, which was located on the chain having the 5' end nucleotide. 

5 Applicants are aware of the following provisional applications that contain teaching 

with regard to chimeric mutational vectors: By Steer et al., Serial No. 60/045,288 filed 
April 30, 1997; Serial No. 60/054,837 filed August 5,1997; Serial No. No. 60/064,996, filed 
November 10, 1997; and by Steer & Roy-Chowdhury et aL, Serial No. 60/074,497, filed 
February 12, 1998, entitled "Methods of Prophylaxis and Treatment by Alteration of APO 

10 B and APO E Genes." 

2.2 Cell-Free Recombination 

Various reports of homologous recombination using a cell-free extract have been 
published. 

15 Hotta, Y., et al., 1985, Chromosoma 93, 140-151 report the use of an extract of 

yeast, mouse spermatocytes and Lilium to effect homologous recombination between two 
mutant pBR322 plasmids. One of the plasmids was supercoiled, the second plasmid could 
be linearized or supercoiled. The maximum rate of recombination was less than 1%. A 
similar experiment using mutant defective pSV2neo and extracts of EJ cells was reported in 
20 Kucherlapati, R.S. et aL 1985, Molecular and Cellular Biology 5, 714-720. The maximum 
rate of recombination was about 0.2%. Kucherlapati reported an absolute requirement that 
one of the mutant plasmids be linearized. In contrast Hotta, reported recombination 
between two circular plasmids, although the rate of recombination between circular and 
linear plasmids was higher. 
- 5 The report of Jessberger, R., & Berg, P., 1991, MoL & Cell. Biol. 11, 445 concerns 

recombination catalyzed by nuclear extracts between plasmids. It stands in contrast to both 
of the above in two respects. The rate of recombination reported was about 20%, in 
contrast to rates of less than 0.5%. In addition Jessberger observed the same rate of 
recombination between circularized plasmids as between a circularized and a linear 
^ plasmid. 

A related experiment using human nuclear extracts was reported by Lopez, B.S., et 
al., 1992, Nucleic Acids Research 20, 501-506. Lopez reported recombination in a cell- 
free system between a linearized plasmid and an unrelated supercoiled plasmid that is not 
viable in the subsequent selection conditions. The linearized and supercoiled plasmid each 
35 contain a lacZ gene; which is a mutant in the linearized plasmid. The linearized plasmid is 
cut in the lacZ. gene at a variable distance from the mutation. Homologous recombination 
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between the site of the mutation and the cut, accordingly, results in the circularization of the 
plasmid that then becomes viable and the gain of lacZ function. Lopez reports no detectable 
homologous recombination when the cut and the mutation were 15 base pairs apart. 
Homologous recombination at a low level was observed when that distance was 27 base 
pairs. No further increase in the rate of homologous recombination was observed when the 
distance was made greater than 165 base pairs. Lopez et al., 1987, Nucleic Acids Research 

2.3 RadSI and RadS2 Activity in Recombination 

Homologous recombination is the process whereby the genes of two chromosomes 
are exchanged. The rate of homologous recombination between two genetic loci is inversely 
proportional to their genetic linkage, tightly linked genes rarely recombine. In addition to its 
genetic function homologous recombination allows a somatic cell to repair DNA damaged 
by double strand breaks. 

The first step in homologous recombination is believed to be synapse formation. A 
synapse is a DNA molecule in which one chain is hybridized to two other chains. Synapse 
formation requires an enzymatic activity and energy input from ATP hydrolysis. An 
artifactual assay in a cell-free system for the enzymatic activity believed to be required for 
synapse formation is "strand transfer." In a typical strand transfer assay a circular single 
strand DNA is combined with a linear duplex to produce a "nicked" or relaxed circular 
duplex and a linear single strand. The RadSI gene from yeast, mice and humans has been 
cloned and catalyzes strand transfer. RadSI is believed to participate in synapse formation. 
Baumann, P., et al., 1996, Cell 87, 757-766; Gupta, R.C., 1997, Proc. Natl Acad. Sci. 94, 
463-468. The strand transfer activity is further enhanced by the presence of Rad52 protein 
and replication protein A. Baumann, P., & West, S.C., 1997, EMBO J. 16, 5198-5206; 
New, J.H., et al., 1998, Nature 391, 407-410; Benson, F.E., et al., 198, Nature 391, 401-404. 
Although RAD5 1 protein unlike Rec A binds to duplex DNA, Baumann & West op cit.\ 
Benson, F.E., et al., EMBO J., 13, 5764-5771, in the presence of RAD52, its binding is 
directed toward single stranded DNA. 

In yeast, RadSI or Rad52 defective individuals are radiation sensitive because of an 
inability to repair double strand breaks. In mice, RadSI knock out results in embryonic 
leathality. Tsuzuki, T., et al M Proc. Natl. Acad. Sci. 93, 6236-6240; Lin, S.D., & Hasty, 
P.A.,Mol. Cell. Biol., 16,7133. 

2.4 Cell-Free Mismatch Repair 

The intrinsic (thermodynamic) fidelity of DNA replication would lead to an 
unacceptably high rate of mutation without the presence of an "error correcting" 
mechanism. Mismatch repair is one such mechanism. In mismatch repair, duplex DNA 
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having a base paired to a non-complementary base is processed so that one of the strands is 
corrected. The process involves the excision of one of the strands and its resynthesis. 
Reports of mismatch repair in cell-free eukaryotic systems can be found in Muster-Nassal & 
Kolodner, 1986, Proc. Natl. Acad. Sci. 83, 7618-7622 (yeast); Glazer, P.M.. et al., 1987, 
Mol. Cell. Biol. 7, 218-224 (HeLacell); Thomas D.C., et al., 1991, J. Biol. Chem., 266, 
5 3744-3751 (HeLa cell); Holmes et al., 1991, Proc. Natl. Acad. Sci., 87, 5837-5841 (HeLa 
cell and Drosophila). The HeLa and Drosophila cell-free systems required that one strand 
of the mismatched duplex be nicked for full activity. By contrast, reports of repair in 
Xenopus egg extracts did not require that the mismatched duplex be nicked. Varlet, I., et al., 
1990, Proc. Natl. Acad. Sci. 87, 7883-7887. However, in Varlet the mismatch was repaired 
10 in a random fashion, i.e., the strands acted as templates with equal frequency. 

Many of the genes required for mismatch repair in yeast and humans have been 
cloned based on homology with the E. coli mismatch repair genes. Kolodner, R., 1996, 
Genes & Development 10, 1433-1442. Cells having defective mismatch repair genes show 
genetic instability, termed Replication Error (RER), particularly evident in microsatellite 
1 5 DNA, and malignant transformation. Extracts of RER cells do not have mismatch repair 
activity. Umar, A., et al., J. Biol. Chem. 269, 14367-14370. 

3. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. An example of the conformation of a double hairpin type recombinagenic 
20 oligomer. The features are: a, first strand; b, second strand; c, first chain of the second 
strand; 1, 5* most nucleobase; 2, 3' end nucleobase; 3, 5' end nucleobase; 4, 3' most 
nucleobase; 5, first terminal nucleobase; 6, second terminal nucleobase. 
Figure 2. An example of the conformation of a single hairpin type recombinagenic 
nucleobase with an overhang. The features are as above with the addition of d ? the 
25 overhang. Note that the same nucleobase is both the 5' most nucleobase of the second 
strand and the 5' end nucleobase. 

4. SUMMARY OF THE INVENTION 

Chimeraplasty is an increasingly important process for the treatment of human 
30 disease and the development of useful, genetically engineered plant and animal strains. The 
development of improved recombinagenic oligonucleotides has been greatly facilitated by 
the use of bacterial testing systems, which give rapid and quantitative results as described in 
commonly assigned regular U.S. patent application Serial No. to be assigned, entitled "Non- 
Chimeric Mutational Vectors" by R. Kumar et al., and provisional application Serial No. to 
35 be assigned, entitled "Heteroduplex Mutational Vectors and Use Thereof in Bacteria" by 
Kumar et al., (hereafter collectively "Kumar") filed on even date herewith, which are hereby 
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incorporated by reference in its entirety. The techniques of Kumar do not address whether 
the optimal recombinagenic oligonucleotides in bacterial systems are also optimal in 
eukaryotes. The prior art techniques of in vivo and cell-culture chimeraplasty are not 
designed for rapid quantitative analysis and are unable to utilize the same recombinagenic 
oligonucleobases and DNA targets as used in the bacterial systems. 

Accordingly, an objective of the present invention is an assay that can use DNA 
targets and recombinagenic oligonucleobases designed for bacterial systems to rapidly 
evaluate the compatibility between different types of recombinagenic oligonucleotides and 
the recombination and repair enzymes of different phyla, e.g., do the recombination and 
mismatch repair enzymes of bacteria, plants, insects and mammals have differing substrate 
preferences? 

A further objective of the invention is an assay that can rapidly determine whether a 
tissue or cell line is a target for chimeraplasty, i.e, whether it contains the requisite enzymes. 
A yet further objective is an assay to determine what agents or treatments can alter the level 
of chimeraplasty activity in a cell line or tissue. A yet further objective of the invention is 
an assay that can determine whether a compound is an agonist or antagonist of the 
recombination and repair pathway. An additional objective of the invention is a practical 
method of making specific genetic changes in a DNA sequence in a cell-free system that is 
an alternative to polymerase chain reaction PCR-based methods. 

The present invention meets these objectives by the unexpected discovery that 
chimeraplasty can be performed in a cell-free system. The components of the cell-free 
system are an enzyme mixture containing strand transfer activity and, optionally, a 
mismatch repair activity, a target DNA sequence and a recombinagenic oligonucleobase. 
The enzyme mixture can be made by obtaining a cell extract, or a mixture of recombinantly 
produced purified enzymes. The target DNA sequence is preferably a plasmid that can be 
used to transform an expression host such as a bacteria. In a preferred embodiment the 
plasmid is supercoiled. The recombinagenic oligonucleobase is any oligonucleotide or 
oligonucleotide derivative that can be used to introduce a site specific, predetermined 
genetic change in a cell. As used herein a DNA duplex consisting of more than 200 
deoxyribonucleotides and no nucleotide derivatives is not a recombinagenic 
oligonucleobase. Typically, a recombinagenic oligonucleobase is characterized by being a 
duplex nucleotide, including nucleotide derivatives or non-nucleotide interstrand linkers, 
and having between 20 and 120 nucleobases or equivalently between 10 and 60 Watson- 
Crick nucleobase pairs. In a preferred embodiment, the recombinagenic oligonucleobase is 
substantially a duplex and contains a single 3' end and 5' end; accordingly, the strands of 
the duplex are covalently linked by oligonucleobase or non-oligonucleobase linkers. A 
further embodiment of the present invention is based on the discovery that the Non- 
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Chimeric Mutational Vectors (NCMV), according to Kumar, are effective substrates for the 
strand transfer and repair enzymes of eukaryotic and. specifically mammalian cells. Yet 
further embodiments of the invention are based on the discovery that two types of 
recombinagenic oligonucleobases, according to Kumar, Heteroduplex Mutational Vectors 
(HDMV) and vectors having a single segment of ribo-type nucleobases in the strand 
opposite the strand containing the 3' end nucleobase and 5' end nucleobase, unexpectedly 
give superior results when used with eukaryotic and specifically in mammalian strand 
transfer and repair enzymes. The term Duplex Mutational Vectors (DMV) is used herein to 
refer to CMV, HDMV and NCMV, collectively. Note that a HDMV can be either chimeric 
or non-chimeric, however, the term CMV does not encompass HDMV. 
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5. DETAILED DESCRIPTION OF THE INVENTION 

According to the present invention a reaction is carried out in a reaction mixture 
containing an enzyme mixture comprising strand transfer and mismatch repair activities, a 
DNA target and a recombinagenic oligonucleobase. In one embodiment the DNA target is 
1 5 a mutated antibiotic resistance gene, e.g., let or mo (kan) of a plasmid and the 

recombinagenic oligonucleobase is a 2'-0-methyl containing a CMV according to Kmiec, at 
about a 1 :200 molar ratio. The function of the mutant tet or kan is restored by specific 
alteration of a single base. The reaction is terminated by phenol/chloroform extraction and 
the extracted plasmid electroporated into RecA or MutS defective bacteria. The extent of 
20 modification of the target DNA can be determined from the ratio of the recombinant (kan 
or tet r ) colonies to the parental type (amp r ). No recombinant colonies, above background, 
were observed when the plasmid and chimera were reacted separately and recombined after 
chloroform/phenol extraction. Recombinant colonies were reduced about 90% when 
extracts of mismatch repair deficient cells (LoVo) were used. These controls indicate that 
25 the modification, up to the point of mismatch excision is completed in the reaction mixture. 
The frequency of recombinant colonies was about 5 per 10 5 parental colonies using CMV 
of the type described in Kren et al. Nature Medicine, 1998, 4, 285-290 and Cole-Strauss et 
al., 1996, Science 273, 1386 (a "Cole-Straus CMV"). 

As used herein a cell-free enzyme mixture is deemed to have strand transfer and 
30 mismatch repair activity when the cell-free mixture can be used to obtain the above 
described result. 

Table I below shows the effects of multiple modifications of the Cole-Strauss CMV 
in both the bacterial and cell-free eukaryotic systems. There is a very good correlation 
between the activity of any modification measured in each system. In particular the 
35 substitution of 2'-0-methyl uracil for thymidine in the interstrand linkers (variants IV and 
V) , the placement of the mutator only in the 5' strand (variant VIb) and deletion of DNA 
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from the 3' strand significantly improved the performance of the recombinagenic 
oligonucleobases in both systems. 

In both systems the placement of the mutator in the 3' strand (variant Via) resulted 
in a substantial loss of function to below one in 10 5 recombinant colonies. The frequency 
observed with variant Via was clearly higher than background. Accordingly, as used herein 
a recombinagenic oligonucleobase is an oligonucleobase of the type that can provide a rate 
of recombination in the above cell-free system at least as high as a recombinagenic 
oligonucleobase made according to variant Via having the same mutator sequence. 

Variant VII with a one base mutator sequence was observed to effect recombination 
with a frequency of 4.4 / 10 5 . This frequency was significantly greater than that observed in 
the bacterial systems as well as that observed in cultured cells. Without limitation as to 
theory, this difference is believed to be due to the relative absence of exonucleases and 
endonucleases from the cell free system. 

5.1 The Cell-Free Enzyme Mixture 

The cell-free enzyme mixture for the practice of the invention contains the strand 
transfer and the mismatch repair activities. As used herein the term "cell-free enzyme 
mixture" indicates that the mixture excludes living cells, and preferably excludes the 
organelles, e.g., nuclei and mitochondria. The extent of the mismatch repair that is required 
in the cell-free enzyme mixture depends on the method used to detect the modification of 
the targeted DNA sequence and the utility. 

When the modification is detected by biochemical means, e.g., restriction 
endonuclease digestion, the mismatch repair activity will include mismatch detection, 
strand cutting and excision and strand resynthesis to fill the excision and ligation. When the 
modification is detected in a recombination defective bacteria, e.g., E. coli strain DH10, the 
strand resynthesis and ligation activities may be omitted from the cell-free enzyme mixture. 
As used herein "mismatch repair activity" does not include the resynthesis and ligation 
activities, which may be present in the cell-free enzyme mixture but are not required in most 
applications. 

In certain applications, e.g., to assay the effects of modifications of the 
recombinagenic oligonucleobase on its efficiency with plant or mammalian enzymes, it is 
preferred that the mismatch repair activity be provided by the cell-free enzyme mixture. 
Detection by biochemical means or in a host such as a MutS bacteria, e.g., NR91 62, which 
lack mismatch repair is preferred. 

For certain applications, it is desirable to separate the complex of target DNA and 
recombinagenic oligonucleobase from the uncomplexed target DNA. Separation can be 
readily accomplished by introducing an affinity ligand, e.g., a biotin, onto the 
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recombinagenic oligonucleobase. In such applications, two cell-free enzyme mixtures can 
be used, one before and one after the separation. The first mixture should contain only the 
strand transfer activity and the second need contain only the mismatch repair activity, 

The cell-free enzyme mixture can be obtained as a cell extract. A procedure of Li & 
Kelly can be used. Li., J.J., et alia., 1985, Mol. Cell. Biol. 5, 1238-1246. The Li & Kelly 

5 procedure is a "cytoplasmic extract." The cells are mechanically disrupted in hypotonic 
buffer and the supernatant from centrifugation of 10 min. at 2,000xg and twice of 15 min. 
at 12,000xg is used. Without limitation as to theory, it is believed that the physiological 
cellular location of the strand transfer and mismatch repair enzymes is the nucleus but that 
during preparation there is sufficient loss of these enzymes from the nucleus. Crude nuclear 

10 extracts made according to Dignam et al., 1983, Nucleic Acid Research 11, 1475 are not 
preferred. 

A cell-free enzyme mixture that lacks mismatch repair can be obtained from extracts 
of mutant cells having the replication error phenotype. Umar et al., 1994, J. Biol. Chem. 
269, 14367. The cell line LoVo has deleted both alleles of the human MutS homolog 
1 5 (MSH2) and is suitable as a source of strand transfer activity without mismatch repair 
activity. 

In an alternative embodiment the cell-free enzyme mixture can be a composition 
comprising recombinantly produced enzymes. The recombinant production of a defined 
enzyme allows for the addition of a known amount of the defined enzyme free of all other 
20 enzymes involved in the strand transfer and mismatch repair. When a defined enzyme is 
added to an extract from a cell that is deficient in that enzyme the result is a defined enzyme 
mixture with regard to that enzyme. The production of recombinant Rad5 1 can be 
accomplished by the methods reported by Gupta, R.C., 1997, Proc. Natl. Acad. Sci. 94, 
463-468. 

25 

5.2 The Recombinagenic Oligonucleobase 

Recombinagenic oligonucleobases for use in a cell-free system can be constructed 
according to the teaching of U.S. Patent No. No. 5,565,35 and No. 5,731,181. Additionally, 
recombinagenic oligonucleobases can be made according to the following. 
30 Definitions 

The invention is to be understood in accordance with the following definitions. 

An oligonucleobase is a polymer of nucleobases, which polymer can hybridize by 
Watson-Crick base pairing to a DNA having the complementary sequence. 

Nucleobases comprise a base, which is a purine, pyrimidine, or a derivative or 
35 analog thereof. Nucleobases include peptide nucleobases. the subunits of peptide nucleic 
acids, and morpholine nucleobases as well as nucleobases that contain a pentosefuranosyl 
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moiety, e.g., an optionally substituted riboside or 2'-deoxyriboside. Nucleotides are 
pentosefuranosyl containing nucleobases that are linked by phosphodiesters. Other 
pentosefuranosyl containing nucleobases can be linked by substituted phosphodiesters, e.g., 
phosphorothioate or triesterified phosphates. 

A olieonucleobase compound has a single 5* and 3' end nucleobase, which are the 

5 ultimate nucleobases of the polymer. Nucleobases are either deoxyribo-type or ribo-type. 
Ribo-tvpe nucleobases are pentosefuranosyl containing nucleobases wherein the 2 f carbon is 
a methylene substituted with a hydroxyl, substituted oxygen or a halogen. Deoxyribo-type 
nucleobases are nucleobases other than ribo-type nucleobases and include all nucleobases 
that do not contain a pentosefuranosyl moiety, e.g., peptide nucleic acids. 

1 0 An olieonucleobase strand generically includes regions or segments of 

oligonucleobase compounds that are hybridized to substantially all of the nucleobases of a 
complementary strand of equal length. An oligonucleobase strand has a 3' most (3' terminal) 
nucleobase and a 5' most (5* terminal) nucleobase. The 3' most nucleobase of a strand 
hybridizes to the 5' most nucleobase of the complementary strand. Two nucleobases of a 

1 5 strand are adjacent nucleobases if they are directly covalently linked or if they hybridize to 
nucleobases of the complementary strand that are directly covalently linked. An 
oligonucleobase strand may consist of linked nucleobases, wherein each nucleobase of the 
strand is covalently linked to the nucleobases adjacent to it. Alternatively a strand may be 
divided into two chains when two adjacent nucleobases are unlinked. The 5' (or 3*) 

20 terminal nucleobase of a strand can be linked at its 5'-0 (or 3'-) to a linker which linker is 
further linked to a 3' (or 5') terminus of a second oligonucleobase strand, which is 
complementary to the first strand, whereby the two strands form a single oligonucleobase 
compound. The linker can be an oligonucleotide, an oligonucleobase or other compound. 
The S'-O and the 3'-0 of a 5' end and 3* end nucleobase of an oligonucleobase compound 

25 can be substituted with a blocking group that protects the oligonucleobase strand. However, 
for example, closed circular oligonucleotides do not contain 3' or 5' end nucleotides. Note 
that when an oligonucleobase compound contains a divided strand the 3' and 5* end 
nucleobases are not the terminal nucleobases of a strand. 
Conformation: 

30 The Duplex Mutational Vectors (DMV) are comprised of polymers of nucleobases, 

which polymers hybridize, i.e., form Watson-Crick base pairs of purines and pyrimidines, to 
DNA having the appropriate sequence. Each DMV is divided into a first and a second 
strand of at least 12 nucleobases and not more than 75 nucleobases. In a preferred 
embodiment the length of the strands are each between 20 and 50 nucleobases. The strands 

^ contain regions that are complementary to each other. In a preferred embodiment the two 
strands are complementary to each other at every nucleobase except the nucleobases 
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wherein the target sequence and the desired sequence differ. At least two non-overlapping 
regions of at least 5 nucleobases are preferred. 

Nucleobases contain a base, which is either a purine or a pyrimidine or analog or 
derivative thereof. There are two types of nucleobases. Ribo-type nucleobases are 
ribonucleosides having a 2-hydroxyl, substituted 2'-hydroxyl or 2'-halo-substituted ribose. 
All nucleobases other than ribo-type nucleobases are deoxyribo-type nucleobases. Thus, 
deoxy-type nucleobases include peptide nucleobases. 

In the embodiments wherein the strands are complementary to each other at every 
nucleobase, the sequence of the first and second strands consists of at least two regions that 
are homologous to the target gene and one or more regions (the "mutator regions") that 
differ from the target gene and introduce the genetic change into the target gene. The 
mutator region is directly adjacent to homologous regions in both the 3' and 5' directions. 
In certain embodiments of the invention, the two homologous regions are at least three 
nucleobases, or at least six nucleobases or at least twelve nucleobases in length. The total 
length of all homologous regions is preferably at least 12 nucleobases and is preferably 16 
and more preferably 20 nucleobases to about 60 nucleobases in length. Yet more preferably 
the total length of the homology and mutator regions together is between 25 and 45 
nucleobases and most preferably between 30 and 45 nucleobases or about 35 to 40 
nucleobases. Each homologous region can be between 8 and 30 nucleobases and more 
preferably be between 8 and 15 nucleobases and most preferably be 12 nucleobases long. 

One or both strands of the DMV can optionally contain ribo-type nucleobases. In a 
preferred embodiment a first strand of the DMV consists of ribo-type nucleobases only 
while the second strand consists of deoxyribo-type nucleobases. In an alternative preferred 
embodiment the second strand is divided into a first and second chain. The first chain 
contains no ribo-type nucleobases and the nucleotides of the first strand that are paired with 
nucleobases of first chain are ribo-type nucleobases. In an alternative embodiment the first 
strand consists of a single segment of deoxyribo-type nucleobases interposed between two 
segments of ribo-type nucleobases. In said alternative embodiment the interposed segment 
contains the mutator region or, in the case of a HDMV, the intervening region is paired with 
the mutator region of the alternative strand. 

Preferably the mutator region consists of 20 or fewer bases, more preferably 6 or 
fewer bases and most preferably 3 or fewer bases. The mutator region can be of a length 
different than the length of the sequence that separates the regions of the target gene 
homology with the homologous regions of the DMV so that an insertion or deletion of the 
target gene results. When the DMV is used to introduce a deletion in the target gene there 
is no base identifiable as within the mutator region. Rather, the mutation is effected by the 
juxtaposition of the two homologous regions that are separated in the target gene. For the 
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purposes of the invention, the length of the mutator region of a DMV that introduces a 
deletion in the target gene is deemed to be the length of the deletion. In one embodiment 
the mutator region is a deletion of from 6 to 1 bases or more preferably from 3 to 1 bases. 
Multiple separated mutations can be introduced by a single DMV, in which case there are 
multiple mutator regions in the same DMV. Alternatively multiple DMV can be used 
simultaneously to introduce multiple genetic changes in a single gene or, alternatively to 
introduce genetic changes in multiple genes of the same cell. Herein the mutator region is 
also termed the heterologous region. When the different desired sequence is an insertion or 
deletion, the sequence of both strands have the sequence of the different desired sequence. 

The DMV is a single oligonucleobase compound (polymer) of between 24 and 150 
nucleobases. Accordingly the DMV contains a single 3' end and a single 5' end. The first 
and the second strands can be linked covalently by nucleobases or by non-oligonucleobase 
linkers. In a preferred embodiment the 3' terminal nucleobase of each strand is protected 
from 3* exonuclease attack. Such protection can be achieved by several techniques now 
known to these skilled in the art or by any technique to be developed. 

In one embodiment protection from 3'-exonuclease attack is achieved by linking the 
3' most (terminal) nucleobase of one strand with the 5' most (terminal) nucleobase of the 
alternative strand by a nuclease resistant covalent linker, such as polyethylene glycol, poly- 
1,3-propanediol or poly-l,4-butanediol. The length of various linkers suitable for 
connecting two hybridized nucleic acid strands is understood by those skilled in the art. A 
polyethylene glycol linker having from six to three ethylene units and terminal phosphoryl 
moieties is suitable. Durand, M. et al., 1990, Nucleic Acid Research 18, 6353; Ma, M. Y- 
X., et al., 1993, Nucleic Acids Res. 21, 2585-2589. A preferred alternative linker is bis- 
phosphorylpropyl-trans-4,4-stilbenedicarboxamide. Letsinger, R.L., et alia, 1994, J. Am. 
Chem. Soc. 116, 81 1-812; Letsinger, R.L. et alia, 1995. J. Am. Chem. Soc. 117, 7323-7328. 
Such linkers can be inserted into the DMV using conventional solid phase synthesis. 
Alternatively, the strands of the DMV can be separately synthesized and then hybridized 
and the interstrand linkage formed using a thiophoryl-containing stilbenedicarboxamide as 
described in patent publication WO 97/05284, February 13, 1997, to Letsinger R.L. et alia. 

In a further alternative embodiment the linker can be a single strand oligonucleobase 
comprised of nuclease resistant nucleobases, e.g., a 2'-0-methyl, 2 , -Oallyl or 2'-F 
ribonucleotides. The tetraribonucleotide sequences TTTT, UUUU and UUCG and the 
trinucleotide sequences TTT, UUU, and UCG are particularly preferred nucleotide linkers. 

In an alternative embodiment 3'-exonuclease protection can be achieved by the 
modification of the 3' terminal nucleobase. If the 3' terminal nucleobase of a strand is a 3* 
end, then a steric protecting group can be attached by esterification to the 3-OH, the 2'-OH 
or to a 2' or 3' phosphate. A suitable protecting group is a 1 ,2-((i>-amino)-alkyldiol or 
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alternatively a l,2-hydroxymethyl-(<o-amino)-alkyl. Modifications that can be made 
include use of an alkene or branched alkane or alkene, and substitution of the o-amino or 
replacement of the a>-amino with an o>-hydroxyl. Other suitable protecting groups include a 
3* end methylphosphonate, Tidd, D.M., et alia, 1989, Br. J. Cancer, 60, 343-350; and 3"- 
aminohexyl, Gamper H.G., et al., 1993, Nucleic Acids Res., 21, 145-150. Alternatively, the 

5 3' or 5* end hydroxyls can be derivatized by conjugation with a substituted phosphorus, e.g., 
a methylphosphonate or phosphorothioate. 

In a yet further alternative embodiment the protection of the 3 f -temiinal nucleobase 
can be achieved by making the 3-most nucleobases of the strand nuclease resistant 
nucleobases. Nuclease resistant nucleobases include peptide nucleic acid nucleobases and 

^ 2' substituted ribonucleotides. Suitable substituents include the substituents taught by 
United States Patent No. 5,731,181, and by U.S. Patent No. 5,334,71 1 (Sproat), which are 
hereby incorporated by reference, and the substituents taught by patent publications EP 629 
387 and EP 679 657 (collectively, the Martin Applications), which are hereby incorporated 
by reference. As used herein a T fluoro, chloro or bromo derivative of a ribonucleotide or a 

1 ^ ribonucleotide having a substituted 2'-0 as described in the Martin Applications or Sproat is 
termed a "2'-Substituted Ribonucleotide." Particular preferred embodiments of 2- 
Substituted Ribonucleotides are 2-fluoro, 2'-methoxy, 2'-propyloxy, 2 l -allyloxy, T- 
hydroxylethyloxy, 2'-methoxyethyloxy, 2 -fluoropropyloxy and 2-trifluoropropyloxy 
substituted ribonucleotides. In more preferred embodiments of 2'-Substituted 

10 

v Ribonucleotides are 2'-fluoro, 2-methoxy, 2-methoxyethyloxy, and 2 , -allyloxy substituted 
nucleotides. 

The term "nuclease resistant ribonucleoside" encompasses including 2-Substituted 
Ribonucleotides and also all 2'-hydroxyl ribonucleosides other than ribonucleotides, e.g., 
ribonucleotides linked by non-phosphate or by substituted phosphodiesters. Nucleobase 
resistant deoxyribonucleosides are defined analogously. In a preferred embodiment, the 
DMV preferably includes at least three and more preferably six nuclease resistant 
ribonucleosides. In one preferred embodiment the CMV contains only nuclease resistant 
ribonucleosides and deoxyribonucleotides. In an alternative preferred embodiment, every 
other ribonucleoside is nuclease resistant. 

Each DMV has a single 3' end and a single 5' end. In one embodiment the ends are 
the terminal nucleobases of a strand. In an alternative embodiment, a strand is divided into 
two chains that are linked covalently through the alternative strand but not directly to each 
other. In embodiments wherein a strand is divided into two chains, the 3' and 5' ends are 
Watson-Crick base paired to adjacent nucleobases of the alternative strand. In such strands, 
^ the 3' and 5* ends are not terminal nucleobases. A 3' end or 5' end that is not the terminal 
nucleobase of a strand can be optionally substituted with a steric protector from nuclease 
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activity as described above. In yet an alternative embodiment, a terminal nucleobase of a 
strand is attached to a nucleobase that is not paired to a corresponding nucleobase of the 
opposite strand and is not a part of an interstrand linker. Such embodiment has a single 
-hairpin" conformation with a 3' or 5' "overhang." The unpaired nucleobase and other 
components of the overhang are not regarded as a part of a strand. The overhang may 
5 include self-hybridized nucleobases or non-nucleobase moieties, e.g., affinity ligands or 
labels. In a particular preferred embodiment of DMV having a 3' overhang, the strand 
containing the 5' nucleobase is composed of deoxy-type nucleobases only, which are paired 
with nbo-type nucleobase of the opposite strand. In a yet further preferred embodiment of 
DMV having a 3' overhang, the sequence of the strand containing the 5' end nucleobase is 
1 ( ) the different, desired sequence and the sequence of the strand having the overhang is the 
sequence of the target DNA. 

A particularly preferred embodiment of the invention is a DMV wherein the two 
strands arc not fully complementary. Rather the sequence of one strand comprises the 
sequence of the target DNA to be modified and the sequence of the alternative strand 
1 5 comprises the different, desired sequence that the user intends to introduce in place of the 
target sequence. It follows that the location where the target and desired sequences differ, 
the bases of one strand are paired with non-complementary bases in the other strand. Such 
DMV arc termed herein Heteroduplex Mutational Vectors (HDMV). In one preferred 
embodiment, the desired sequence is the sequence of a chain of a divided strand. In a 
20 second preferred embodiment, the desired sequence is found on a chain or a strand that 

contains no ribo-type nucleobases. In a more preferred embodiment, the desired sequence is 
the sequence of a chain of a divided strand, which chain contains no ribo-type nucleobases. 
Internucleobase linkages 

The linkage between the nucleobases of the strands of a DMV can be any linkage 
25 that is compatible with the hybridization of the DMV to its target sequence. Such 

sequences include the conventional phosphodiester linkages found in natural nucleic acids. 
The organic solid phase synthesis of oligonucleotides having such nucleotides is described 
in U.S. Patent No. Re: 34,069. 

Alternatively, the internucleobase linkages can be substituted phosphodiesters, e.g., 
30 phosphorothioates, substituted phosphotriesters. Alternatively, non-phosphate, phosphorus- 
containing linkages can be used. U.S. Patent No. 5,476,925 to Letsinger describes 
phosphoramidate linkages. The 3 , -phosphoramidate linkage (3 , -NP(0')(0)0-5 l ) is well 
suited for use in DMV because it stabilizes hybridization compared to a 5'- 
phosphoramidate. Non-phosphate linkages between nucleobases can also be used. U.S. 
35 Patent No. 5,489,677 describes internucleobase linkages having adjacent N and O and 
methods of their synthesis. The linkage 3 , -ON(CH3)CH 2 -5' (methylenemethylimmino) is a 
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preferred embodiment. Other linkages suitable for use in DMV are described in U.S. Patent 
No. 5,731,181 to Kmiec. Nucleobases that lack a pentosefuranosyl moiety and are linked 
by peptide bonds can also be used in the invention. Oligonucleobases containing such so- 
called peptide nucleic acids (PNA) are described in U.S. Patent No. 5,539,082 to Nielsen. 
Methods for making PNA/nucleotide chimera are described in WO 95/14706. 

5 5.3 Specific Uses 

Heteroduplex Mutational Vectors of the invention and Non-chimeric Mutational 
Vectors of the invention can be used in any eukaryotic cell in the place of the prior art 
Chimeric Mutational Vectors. Patent publication WO 97/41 141 by Kmiec et al. teaches the 
use of Chimeric Mutational Vectors, ex vivo as do U.S. Patent No. 5,565,350 and U.S. 

10 Patent No. 5,731,181. Krenetal., 1998, Nature Medicine 4,285 provides guidance for the 
use of Chimeric Mutational Vectors in vivo. 

The recombinagenic oligonucleotides can be used in cell-free systems for several 
purposes, which will be apparent to those skilled in the art. Examples without limitation are 
as follows. 

1 ^ The effects of modification in the purity, chemistry, size and/or conformation of 

recombinagenic oligonucleotides can be rapidly and quantitatively tested in cell-free 
systems. The cell-free system has the further advantages that efficiency of recombination 
can be measured independently of the efficiency of delivery. 

The cell-free system can be used to test compounds that are intended to inhibit or 

20 

enhance the activity of the enzymes needed for chimeraplasty, in an alternative embodiment 
test for compounds that replace an enzyme of the mixture. Inhibitory compounds may be 
competitive or non-competitive inhibitors that act directly on the enzymes involved. 
Alternatively, the inhibitors can act on the cell from which an extract is made to block the 
synthesis or accelerate the degradation of an enzyme. These compounds may act by 

25 

inducing or suppressing the synthesis of the relevant enzymes or may act by inducing post- 
synthetic modifications that activate or inactivate the relevant enzymes. 

The cell-free system can be further used to test the relevance or particular proteins to 
the mechanism of chimeraplasty. Such testing can, for example without limitation be 

performed by use of protein-specific monoclonal antibodies to determine whether the 

30 • • 

protein m question is relevant to chimeraplasty. 

A further use of the cell-free system is the specific modification of plasmid, or other 

isolated DNA molecules. In one embodiment of use for this purpose, the recombinagenic 

oligonucleobase contains an affinity ligand, such as biotin, that allows the separation of the 

complex with the target DNA from the uncomplexed target DNA. The chimeraplasty 

^ reaction is, in this embodiment, performed using a separate strand transfer step and a 

mismatch repair step. This embodiment can be used to increase the proportion of modified 
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DNA targets, so that non-selectable modifications can be made without undue expenditure 
of material and effort in screening. In one embodiment, the receptor for the affinity ligand 
is bound to a solid phase particle so that the recombinagenic oligonucleobase/target DNA 
complex is attached to the particle. In the second stage of the reaction the mismatch repair 
activity results in the modification and release of the target DNA, whereby the supernatant 
of the second stage of the process is enriched for the modified plasmid. 

6. EXAMPLES 

Table I below shows the relative numbers of kanamycin and ampicillin resistant 
colonies using variants of Kany.y to correct a stop-codon causing CG transversion in the 
kan resistance gene. 

The following materials and methods were employed to obtain these data. 
Cell-Free Extracts: HuH-7 (Nakabayashi, H., et al, 1982, Cancer Res. 42, 3858) cells are 
grown in DMEM supplemented with 10% fetal bovine serum to mid log phase, about 5 x 
10 5 cells/ml. The cells are mechanically dislodged from the tissue culture flask and 
1 5 pelleted at 500xg. The pellet is washed in ice-cold Hypotonic Buffer with sucrose (20 mM 
HEPES, pH 7.5, 5 mM KC1, 1.5 mM MgCl 2 , 1 mM DTT, 250 mM sucrose), washed in ice- 
cold Hypotonic Buffer without sucrose and then resuspended in Hypotonic Buffer at 6.5 x 
10 7 cells/ml and incubated on ice for 15 min. Thereafter the cells are lysed using a Dounce 
homogenizes 3-5 strokes, and thereafter incubated a further 45 min on ice. The lysate is 

20 

cleared by centrifugation at 10,000xg for 10 min. and the supernatant aliquoted and stored 
at 

-80°C until use. 

Reaction Conditions: The cell-free enzyme mixture, plasmid and DMV are reacted in a 
final volume of 50 nl. The reaction buffer is 20 mM Tris, pH 7.4, 15 mM MgCl 2 , 0.4 mM 

~ 5 DTT, and 1.0 mM ATP. Plasmid, DMV and extract protein final concentrations are 20 
Hg/ml, 20 jig/ml and 600 ug/ml, respectively. The reaction is run in 500 fil "Eppendorf ' 
tubes. The tubes are prechilled on ice and the reagents added and mixed except for the 
extract. The extract is then added and the reaction incubated 45 min at 37°C. The reaction 
is stopped by chloroform/phenol extraction. The nucleic acid is precipitated with 10% (v/v) 

30 3M sodium acetate, pH 4.8 and 2 volumes of absolute EtOH, at -20°C. 

Bacterial Transformation: The precipitated, DMV-treated plasmid is dissolved and 
bacteria are transformed by electroporation according to standard techniques. After 
electroporation the bacteria are incubated for 1 hr in the absence of antibiotic (kanamycin) 
and then for 4 hours in the presence of 20% of the selective level of antibiotic 

35 

Anal V sis: The effectiveness of the DMV can be ascertained from the ratio of the 
kanamycin resistant colonies and the ampicillin resistant colonies, which is a measure of the 

- 15- 



BNSOOCID: <WO B9S8702A1J_> 



WO 99/58702 



PCT/US99/10519 



recovery of the plasmid and the efficiency of electroporation. The ratio given in the table 
below is based on data obtained after a 4 hour incubation with a sub-selective level of 
kanamycin. Such selective incubation results in an increase in kan r colonies of about 100 
fold. The absolute frequencies, which have been corrected for the pre-plating selection are 
reported. 

5 DMV: The general structure of a Duplex Mutational Vector for the introduction of 

kanamycin resistance is given below. The intervening segment, 3' homology region, and 5' 
homology region are designated "I", "H-3" and M H-5"\ respectively. The interstrand 
linkers are designated "L". An optional chi site (5 ! -GCTGGTGG-3') and its complement 
arc indicated as X and X' respectively. The 3' and 5' mutator region are single nucleotides 

1 0 indicated as M 3 and M 5 ', respectively. Variant I is similar to the Chimeric Mutational 
Vectors described in Cole-Strauss, 1996, Science 273, 1386, and Kren, 1998, Nature 
Medicine 4. 285-290. Variant I is referred to as Kany.y elsewhere in this specification. 
The symbol for a feature of a variant indicates that the feature of the variant is the same 
as variant I. 

1 5 The above DMV causes a CG transversion that converts a TAG stop codon into a 

TAC tyr codon. Note that the first strand of I lacks an exonuclease protected 3' terminus 
and that the second strand of I is a divided strand, the first chain of which is the desired, 
different sequence. Variants IV and V are a Chimeric Mutational Vector and a Non- 
Chimeric Mutational Vector, respectively, having 3' termini exonuclease protected by a 

20 nuclease resistant linker (2'OMe-U 4 ). Variants Via and VIb are Chimeric Heteroduplex 
Mutational Vectors. Variant VIb is the variant in which the desired, different sequence is 
found on the first chain, which chain consists of DNA-type nucleotides only. 

The tabic below gives the activities of the variants relative to the variant I in for a 
bacterial system and gives the frequency of conversion to kan7 10 5 plasmids for a cell-free 

25 extract. The background rates were negligible compared to the experimental values except 
for variant Via in the cell-free system and bacterial systems and variant VII in bacteria. 
The data reported for these variants are background corrected. Variants Via and VII show 
low or absent activity. Each of variants III-V are superior in both systems to variant I, 
which is of the type described in the scientific publications of Yoon, Cole-Strauss and Kren 

30 cited herein above. Variant VIII is the optimal chimera based on inference from these data. 
The results shows an excellent correlation between activity in the cell-free extract 
and activity in the bacterial system. In particular, in both systems variants IV and VIb are 
superior to Kany.y and in both systems the Non Chimeric Mutational Vectors are active. 
The only disparity is variant VII, which contains solely deoxynucleotides. Variant VII is 
35 active in the cell-free extract but not the bacterial system. Deoxyoligonucleotides have also 
been found inactive in eukaryotic cells. Without limitation as to theory, applicants believe 
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that the activity of variant VII in the cell-free system is due to the reduced amount of 
nucleases present in the system compared to cell-containing systems. In particular, 
applicants have found that a 5'-end labeled 46 nt single strand DNA was not degraded (< 

1 %) by the cell-free extract in a 10 min incubation at 37°C incubation. A like result was 
obtained with a 46 bp 5' end labeled linear duplex DNA substrate. The reaction buffer was 

2 mM ATP, 1 mM DTT, 25 mM Tris-Acetate, pH 7.15, 5 mM Mg. 
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The sequences of DMV for the introduction of tetracycline resistance is given below: 
TetA208T 



25 



H-3' 



I 



H-5' 



+ 



GCGCG-aaggcu gucg TA ACG guc agugau a 
T4 CGCGC TTCCGa'cAGC AT ' TGC CAg'tCACTA ' T^* 



1 SEQIDNo. 3 



30 



Tetl53 



35 



H-3' 



H-5' 



+ 



GCGCG-auccgu aucc GA ACC aau acggcc a 



CGCGC TAGGCA TAGG CT TGG TTA'TGCCGG'T 
3' 5' 



L T « 



SEQIDNo. 4 
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WE CLAIM: 

1 . A cell-free composition for the modification of a DNA sequence comprising: 

a. a duplex DNA containing a target sequence; 

b. a recombinagenic oligonucleobase. which targets the DNA sequence and 
encodes the modification thereof; and 

c. a cell-free enzyme mixture comprising a strand transfer activity. 

2. The composition of claim 1 , in which the oligonucleobase comprises at least 20 and 
not more than 200 nucleobases. 

3. The composition of claim 1 , in which the oligonucleobase comprises at least 10 and 
not more than 60 Watson-Crick nucleobase pairs. 

4. The composition of claim 1. in which the oligonucleobase comprises a single 3' end 
2 5 and a single 5' end. 

5. The composition of claim 1 , in which the duplex DNA comprises two closed 
circular DNA polymers. 

6. The composition of claim 1, in which the duplex DNA sequence is a portion of a 

20 gene-of-interest that is operably linked to a promoter, so that the gene-of-interest can 

be expressed in a host organism. 

7. The composition of claim 6. in which the cell-free enzyme mixture lacks mismatch 
repair activity. 

25 

8. The composition of claim 1 ; in which the strand transfer activity is provided by a 
eukaryote-derived enzyme. 



30 



9. The composition of claim 8, in which the cell-free enzyme mixture is a defined 
enzyme mixture with regard to a Rad51 mammalian homolog or a Rad52 
mammalian homolog. 

1 0. The composition of claim 8, in which the cell-free enzyme mixture is an extract of a 
eukaryotic cell. 

35 11. The composition of claim 1 0, in which the cell-free enzyme mixture is an extract of 
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a mammalian cell. 

12. The composition of claim 1 , in which the cell-free enzyme mixture further 
comprises a mismatch repair activity. 

5 

13. The composition of claim 12, in which the mismatch repair activity is provided by a 
eukaryote-derived enzyme. 



10 



14. The composition of claim 1 3, in which the cell-free enzyme mixture is an extract of 
a eukaryotic cell. 

15. The composition of claim 14, in which the cell-free enzyme mixture is an extract of 
a mammalian cell. 

16. The composition of claim 12. in which the strand transfer activity is provided by a 
1 5 eukaryote-derived enzyme. 

17. The composition of claim 16, in which the cell-free enzyme mixture is a eukaryotic 
cell extract. 

18. The composition of claim 1, in which the recombinagenic oligonucleobase is a 
^ duplex mutational vector comprising: 

a. a first oligonucleobase strand of at least 12 linked nucleobases and not more 
than 75 linked nucleobases, which strand has a first and a second terminal 
nucleobase; 

b. a second oligonucleobase strand having an equal number of nucleobases as 
25 the first strand, which strand is optionally divided into a first chain and a 

second chain; and 

c. a 3' end nucleobase and a 5 f end nucleobase; 
in which 

i. the 3' most and 5' most nucleobases of the second strand are Watson- 
Crick base paired to the first terminal and the second terminal 
nucleobase, respectively, and 

ii. the second strand contains at least two non-overlapping regions of at 
least 5 contiguous nucleobases that are Watson-Crick base paired to 
nucleobases of the first strand; 

wherein the sequence of at least one strand comprises the different, desired 

35 
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sequence. 

19. A method of modifying a site of a gene-of- interest which comprises the steps of: 

a. reacting 

5 i. a recombinagenic oligonucleobase, that encodes a modification of a 

gene-of-interest, 

ii. a duplex DNA molecule containing the gene-of-interest operably 

linked to a promoter, so that the gene of interest can be expressed in a 
host organism, and 

in. a cell-free enzyme mixture comprising a strand transfer activity and a 
1 0 mismatch repair activity ; 

whereby the gene-of-interest is modified at the target site; 

b. introducing the modified gene-of-interest into the organism; and 

c. detecting the expression of the modified gene-of-interest. 

1 5 20. The method of claim 1 9, wherein the oligonucleobase comprises at least 20 and not 
more than 200 nucleobases. 

21. The method of claim 19, wherein the oligonucleobase comprises at least 10 and not 
more than 60 Watson-Crick nucleobase pairs. 

20 

22. The method of claim 1 9, wherein the oligonucleobase comprises a single 3* end and 
a single 5' end. 

23. The method of claim 19 ? wherein the duplex DNA comprises two closed circular 
DNA polymers. 

25 

24. The method of claim 1 9, wherein the expression of the modified gene-of-interest 
confers a selectable trait on the organism. 

25. The method of claim 19, wherein the expression of the modified gene-of-interest 
30 confers an observable trait on the organism. 

26. A method of altering a DNA sequence, which comprises the steps of: 
a. reacting 

i. a recombinagenic oligonucleobase, that encodes a modification of a 
^ DNA sequence, 
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ii. a duplex DNA molecule containing the sequence, and 

Hi. a cell-free enzyme mixture comprising a strand transfer activity and a 

mismatch repair activity; 
whereby the sequence is modified; 
5 b. detecting the modified sequence. 

27. The method of claim 26, which further comprises fractionating a cell-free 
composition so as to enrich the modified duplex DNA relative to the unmodified 
duplex DNA, prior to detecting the modified sequence. 

28. The method of claim 26, wherein the oligonucleobase comprises at least 20 and not 
more than 200 nucleobases. 

29. The method of claim 26, wherein, the oligonucleobase comprises at least 10 and not 
more than 60 Watson-Crick nucleobase pairs. 

15 

30. The method of claim 26, wherein the oligonucleobase comprises a single 3' end and 
a single 5* end. 

3 1 . The method of claim 26, in which the recombinagenic oligonucleobase is a duplex 
mutational vector comprising: 

a. a first oligonucleobase strand of at least 12 linked nucleobases and not more 
than 75 linked nucleobases, which strand has a first and a second terminal 
nucleobase; 

b. a second oligonucleobase strand having an equal number of nucleobases as 
the first strand, which strand is optionally divided into a first chain and a 
second chain; and 

c. a 3' end nucleobase and a 5' end nucleobase; 
in which 

i. the 3' most and 5' most nucleobases of the second strand are Watson- 
Crick base paired to the first terminal and the second terminal 
nucleobase, respectively, and 

ii. the second strand contains at least two non-overlapping regions of at 
least 5 contiguous nucleobases that are Watson-Crick base paired to 
nucleobases of the first strand; 

wherein the sequence of at least one strand comprises the different, desired 
sequence. 
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32. A method of transforming a target DN A sequence into a different, desired sequence 
in a eukaryotic cell that comprises (A) administering to the cell a duplex mutational 
vector comprising: 

a. a first oligonucleobase strand of at least 1 2 linked nucleobases and not more 
than 75 linked nucleobases. which strand has a first terminal nucleobase and 
a second terminal nucleobase; 

b. a second oligonucleobase strand having a 3' most nucleobase and a 5' most 
nucleobase and having a number of nucleobases equal to the first strand, 
which second strand is optionally divided into a first chain and a second 
chain; and 

10 c. a 3' end nucleobase and a 5' end nucleobase; 

in which 

i. the 3* most and 5' most nucleobases of the second strand are Watson- 
Crick base paired to the first terminal and the second terminal 
nucleobase of the first strand, respectively, 
1 5 ii. said 3' most nucleobase and said second terminal nucleobase are 

protected from 3 1 exonuclease attack, and 
iii. the second strand contains at least two non-overlapping regions of at 
least 5 contiguous nucleobases that are Watson-Crick base paired to 
nucleobases of the first strand; 
provided that there are not more than two contiguous Watson-Crick base pairs 
compnsed of a ribo-type and a deoxyribo-type nucleobase; and 
(B) detecting DNA from or in the cell or the progeny thereof having the different, 
desired sequence. 

The method of claim 32, wherein each nucleobase of the first strand is Watson- 
Crick paired to a complementary nucleobase of the second strand. 

The method of claim 32, wherein the sequence of the first strand comprises the 
sequence of the different, desired sequence. 

The method of claim 32, wherein the first terminal nucleobase and the 3' most 
nucleobase are linked by a linker comprising a moiety selected from the group 
consisting of 2'-methoxy-uridine, 2'-allyloxy-uridine, 2 , -fluoro-uridine, 2'-methoxy- 
thymidine, 2'-allyloxy-thymidine, 2'-fluoro-thymidine, polyethylene glycol and 
trans-4,4'-stilbenecarboxamide. 
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36. The method of claim 32, wherein the second terminal nucleobase and the 5' most 
nucleobase are linked by a linker comprising a moiety selected from the group 
consisting of 2'-methoxy-uridine, 2'-aIlyIoxy-uridine, 2'-fluoro-uridine ? 2'-methoxy- 
thymidine, 2 , -allyloxy-thymidine, 2'-fluoro-thymidine, polyethylene glycol and 
trans-^-stilbenecarboxamide. 

37. The method of claim 32, wherein the second strand is comprised of a first chain and 
a second chain and the first chain contains no ribo-type nucleobases. 

38. The method of claim 37, wherein each nucleobase of the first strand is Watson-Crick 
paired to a complementary nucleobase of the second strand. 

39. The method of claim 37, wherein the sequence of the different, desired sequence is 
the sequence of the first chain. 

5 40. The method of claim 32, wherein the first chain comprises the 5* end nucleobase. 

41 . The method of claim 32, wherein the first chain comprises the 3' end nucleobase. 

42. A method of transforming a target DNA sequence into a different, desired sequence 
in a eukaryotic cell that comprises (A) administering to the cell a duplex mutational 
vector comprising: 

a. a first oligonucleobase strand of at least 12 linked nucleobases and not more 
than 75 linked nucleobases, which strand has a first and a second terminal 
nucleobase; 

b. a second oligonucleobase strand having a 3' most and a 5' most nucleobase 
and having a number of nucleobases equal to the first strand, which second 
strand is optionally divided into a first chain and a second chain; and 

c. a single 3* end nucleobase and a single 5' end nucleobase; 
in which 

i. the 3' most and 5' most nucleobases of the second strand are Watson- 
Crick base paired to the first terminal and the second terminal 
nucleobase of the first strand, respectively, and 

ii. the second strand contains at least two non-overlapping regions of at 
least 5 contiguous nucleobases that are Watson-Crick base paired to 
nucleobases of the first strand; 

wherein the sequence of the first strand comprises the sequence of the target DNA 
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and the sequence of the second strand comprises the different, desired sequence; and 
(B) detecting the presence in the cell or the progeny thereof of the DNA having the 
different, desired sequence. 

43. The method of claim 42, wherein the first strand comprises at least 12 ribo-type 
nucleobases. 

44. The method of claim 42, wherein the second strand is divided into a first chain and a 
second chain. 

45. The method of claim 44, wherein the sequence of the first chain is the different, 
desired sequence and the first chain contains no ribo-type nucleobases. 

46. The method of claim 45, wherein the first chain comprises the 5' end nucleobase. 

47. The method of claim 45, wherein the first chain comprises the 3* end nucleobase. 

48. A method of transforming a target DNA sequence into a different, desired sequence 
in a eukaryotic cell that comprises: 

(A) administering to the cell a duplex mutational vector comprising: 

a. a first oligonucleobase strand of at least 12 linked nucleobases and not more 
than 75 linked nucleobases, which strand has a first and a second terminal 
nucleobase; 

b. a second oligonucleobase strand having a 3' most and a 5 f most nucleobase 
and having a number of nucleobases equal to the first strand, which second 
strand is optionally divided into a first chain and a second chain; and 

c. a single 3' end nucleobase and a single 5' end nucleobase, 
in which 

i. the 3' most and 5 ? most nucleobases of the second strand are Watson- 
Crick base paired to the first terminal and the second terminal 
nucleobase of the first strand, respectively, and 
) ii. the second strand contains at least two non-overlapping regions of at 

least 5 contiguous nucleobases that are Watson-Crick base paired to 
nucleobases of the first strand; 
wherein the sequence of a strand comprises the sequence of the target DNA and the 
sequence of a strand comprises the sequence of the different, desired sequence and 
the oligonucleobase segment having said different, desired sequence is comprised 
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of at least 12 contiguous deoxyribo-type nucleobases; and 

(B) detecting the presence in the cell or the progeny thereof of the DNA having the 

different, desired sequence. 

5 49. The method of claim 48, wherein the sequence of the first strand comprises the 
sequence of the target DNA. 

50. The method of claim 48, wherein the sequence of the first strand comprises the 
sequence of the different, desired sequence. 

51. The method of claim 48, wherein the second strand is comprised of a first chain and 
a second chain and the first chain contains no ribo-type nucleobases. 

52. The method of claim 5 1 , wherein the sequence of the target DNA is the sequence of 
the first chain. 

15 

53. The method of claim 51, wherein the sequence of the different, desired sequence is 
the sequence of the first chain. 

54. A method of transforming a target DNA sequence into a different, desired sequence 
2o in a eukaryotic cell that comprises: 

(A) administering to the cell a chimeric duplex mutational vector comprising: 

a. a first oligonucleobase strand of at least 12 linked nucleobases and not more 
than 75 linked nucleobases, which strand has a first terminal and a second 
terminal nucleobase; 

b. a second oligonucleobase strand having a 3' most nucleobase and a 5' most 
nucleobase and having a number of nucleobases equal to the first strand, 
which second strand is divided into a first chain and a second chain; and 

c. a y end nucleobase and a 5' end nucleobase; 
in which 

i. the 3' most and 5' most nucleobases of the second strand are Watson- 
30 Crick base paired to the first terminal and the second terminal 

nucleobase of the first strand, respectively, 

ii. the nucleobases of the first chain are deoxy-type nucleobases and 
nucleobases of the first strand paired therewith are nuclease resistant 
ribo-type nucleobases; 

35 i«- the second strand contains at least two non-overlapping regions of at 
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least 5 contiguous nucleobases that are Watson-Crick base paired to 

nucleobases of the first strand, 
wherein the sequence of a strand comprises the sequence of the target DNA and the 
sequence of a strand comprises the sequence of the different, desired sequence and 
5 the oligonucleobase segment having said different, desired sequence is comprised of 

at least 12 contiguous deoxyribo-type nucleobases; and 

(B) detecting the presence in the cell or the progeny thereof of the DNA having the 
different, desired sequence. 

55. The method of claim 54, wherein the first chain comprises the 5* end nucleobase. 

10 

56. The method of claim 55, wherein not more than one nucleobase of the first chain is 
paired with a non-complementary nucleobase of the first strand. 

57. A method of transforming a target DNA sequence into a different, desired sequence 
15 in a eukaryotic cell that comprises: 

(A) administering to the cell a chimeric duplex mutational vector comprising: 

a. a oligonucleobase strand of at least 12 linked nucleobases and not more than 
75 linked nucleobases, which strand has a first terminal and a second 
terminal nucleobase; and 

b. a oligonucleobase chain having a 3' most nucleobase and a 5' end 
nucleobase; and 

c. a 3 r overhang attached to the second terminal nucleobase, 
in which 

i. the 3 f most and 5' end nucleobases of the chain are Watson-Crick 
base paired to the first terminal and the second terminal nucleobase of 

25 the first strand, respectively, 

ii. the nucleobases of the chain are deoxy-type nucleobases and 
nucleobases of the strand paired therewith are nuclease resistant ribo- 
type nucleobases; 

iii. the second strand contains at least two non-overlapping regions of at 
30 least 5 contiguous nucleobases that are Watson-Crick base paired to 

nucleobases of the first strand, 
wherein the sequence of the chain comprises the sequence of the different, desired 
sequence; and 

(B) detecting the presence in the cell or the progeny thereof of the DNA having the 
35 different, desired sequence. 
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58. The method of claim 57, wherein the sequence of the strand comprises the sequence 
of the target DNA. 

59. The method of claim 58, wherein not more than one nucleobase of the first chain is 
5 paired with a non-complementary nucleobase of the first strand. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Kmiec, Eric B. 

Gamper, Howard B. 
Cole-Strauss, Allyson D. 

5 (ii) TITLE OF THE INVENTION: EUKARYOTIC USE OF NON-CHIMERIC 

MUTATIONAL VECTORS CHIMERIC 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Kimeragen, Inc. 

(B) STREET: 3 00 Pheasant Run 

(C) CITY: Newtown 
10 (D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 18940 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 
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(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

<B) FILING DATE: 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Hansburg, Daniel 

(B) REGISTRATION NUMBER: 36156 

(C) REFERENCE/DOCKET NUMBER: 7991-035-999 

(ix) TELECOMMUNICATION INFORMATION* 
(A) TELEPHONE: 215-504-4444 

25 (B) TELEFAX: 215-504-4545 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 
30 <C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GcSSSgC SSSSS St?* 0 ™ TGGTTTTCCA ^TO-ro CCCAOTCSTA SO 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GCTATTCGGC TASGACTGGG CACAATTTTT TGTGCCCAGT CSTAGCCGAA TAGCGCGCGT 60 
TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

TTCCGACAGC ATTGCCAGTC ACTATTTTTA TAGTGACTGG CAATGCTGTC GGAAGCGCGT 60 
15 TTTCGCGC 68 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 68 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TAGGCATAGG CTTGGTTATG CCGGTTTTTA CCGGCATAAC CAAGCCTATG CCTAGCGCGT 60 
TTTCGCGC 68 
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